A Metrical Model of Rhythm and Intonation for French Text-to-speech Synthesis

نویسندگان

  • Albert Di Cristo
  • Philippe Di Cristo
  • Jean Véronis
چکیده

This paper presents the prosodic component of a French text-to-speech synthesis system based on a metrical model of rhythm and intonation in which the prosodic well-formedness of utterances is governed by a set of rhythmic and morphosyntactic constraints. We first set out the theoretic basis of the generation of prosodic levels that correspond to the metrical and tonal structure of utterances. Then, we outline the implementation in our system, and, in particular, the prosodic module that produces a metrical interpretation of phrase-level parsed text, by computing relative prominence levels and generating the F0 patterns and segmental duration. This approach produces high quality results for text-tospeech synthesis at a very minimal implementation cost, and enables a realistic modelling of the prosodic variability observed in real speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Synthesizing Elaborate Intonation Contours in Text-to-Speech for French

This paper presents a modular TTS system (called MINGUS) which exploits syntactic information contained in the input and allows additional annotation of the input in order to obtain particular intonation contours or to vary most prosodic parameters. This system is based on a tonal representation of French intonation, on a model of the interaction between syntax and prosody, and on a model of th...

متن کامل

Fully automatic prosody generator for text-to-speech

Text-to-Prosody systems based on the use of prosodic databases extracted from natural speech will be a key point for further development of new Text-to-Speech systems. This paper describes a system using such speech databases to generate the rhythm and the intonation of a French written text. The system is based on a very crude chinks ’n chunks prosodic phrasing algorithm and on a prosodic anal...

متن کامل

A stochastic model of intonation for French text-to-speech synthesis

This paper presents a stochastic model of French intonation contours for use in text-to-speech synthesis. The model has two modules, a linguistic module that generates abstract prosodic labels from text, and a phonetic module that generates an F0 curve from the abstract prosodic labels. This model differs from previous work in the abstract prosodic labels used, which can be automatically derive...

متن کامل

Improving text-to-speech synthesis

Naturalness in human speech is dependent on a number of factors and the extent to which a text-to-speech synthesis system can account for these factors in its model will be a measure of its success in the marketplace. As well as the obvious factors of rhythm and intonation there is the more difficult question of modelling the variability in human speech. This paper discusses how SPRUCE [1], a h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997